Search CORE

8 research outputs found

Data Leak Detection As a Service: Challenges and Solutions

Author: Shu Xiaokui
Yao Danfeng (Daphne)
Publication venue
Publication date: 01/01/2012
Field of study

We describe a network-based data-leak detection (DLD) technique, the main feature of which is that the detection does not require the data owner to reveal the content of the sensitive data. Instead, only a small amount of specialized digests are needed. Our technique – referred to as the fuzzy fingerprint – can be used to detect accidental data leaks due to human errors or application flaws. The privacy-preserving feature of our algorithms minimizes the exposure of sensitive data and enables the data owner to safely delegate the detection to others.We describe how cloud providers can offer their customers data-leak detection as an add-on service with strong privacy guarantees. We perform extensive experimental evaluation on the privacy, efficiency, accuracy and noise tolerance of our techniques. Our evaluation results under various data-leak scenarios and setups show that our method can support accurate detection with very small number of false alarms, even when the presentation of the data has been transformed. It also indicates that the detection accuracy does not degrade when partial digests are used. We further provide a quantifiable method to measure the privacy guarantee offered by our fuzzy fingerprint framework

Computer Science Technical Reports @Virginia Tech

POIROT: Aligning Attack Behavior with Kernel Audit Records for Cyber Threat Hunting

Author: Antonakakis Manos
Antonakakis Manos
Gallagher Brian
Gao Peng
Gao Peng
Goel A.
Hossain Md Nahid
Hossain Md Nahid
Ji Yang
Ma Shiqing
Milajerdi Sadegh M.
Milajerdi Sadegh M.
Shu Xiaokui
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/09/2019
Field of study

Cyber threat intelligence (CTI) is being used to search for indicators of attacks that might have compromised an enterprise network for a long time without being discovered. To have a more effective analysis, CTI open standards have incorporated descriptive relationships showing how the indicators or observables are related to each other. However, these relationships are either completely overlooked in information gathering or not used for threat hunting. In this paper, we propose a system, called POIROT, which uses these correlations to uncover the steps of a successful attack campaign. We use kernel audits as a reliable source that covers all causal relations and information flows among system entities and model threat hunting as an inexact graph pattern matching problem. Our technical approach is based on a novel similarity metric which assesses an alignment between a query graph constructed out of CTI correlations and a provenance graph constructed out of kernel audit log records. We evaluate POIROT on publicly released real-world incident reports as well as reports of an adversarial engagement designed by DARPA, including ten distinct attack campaigns against different OS platforms such as Linux, FreeBSD, and Windows. Our evaluation results show that POIROT is capable of searching inside graphs containing millions of nodes and pinpoint the attacks in a few minutes, and the results serve to illustrate that CTI correlations could be used as robust and reliable artifacts for threat hunting.Comment: The final version of this paper is going to appear in the ACM SIGSAC Conference on Computer and Communications Security (CCS'19), November 11-15, 2019, London, United Kingdo

arXiv.org e-Print Archive

Crossref

DECT: Distributed Evolving Context Tree for Understanding User Behavior Pattern Evolution

Author: Laptev Nikolay
Shu Xiaokui
Yao Danfeng
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 05/03/2016
Field of study

Internet user behavior models characterize user browsing dynamics or the transitions among web pages. The models help Internet companies improve their services by accurately targeting customers and providing them the information they want. For instance, specific web pages can be customized and prefetched for individuals based on sequences of web pages they have visited. Existing user behavior models abstracted as time-homogeneous Markov models cannot efficiently model user behavior variation through time. This demo presents DECT, a scalable time-variant variable-order Markov model. DECT digests terabytes of user session data and yields user behavior patterns through time. We realize DECT using Apache Spark and deploy it on top of Yahoo! infrastructure. We demonstrate the benefits of DECT with anomaly detection and ad click rate prediction applications. DECT enables the detection of higher-order path anomalies and provides deep insights into ad click rates with respect to user visiting paths

Association for the Advancement of Artificial Intelligence: AAAI Publications

Anomaly detection as a service: challenges, advances, and opportunities

Author: Bertino Elisa
Cheng Long
Sandhu Ravi
Shu Xiaokui
Stolfo Salvatore J
Yao Danfeng (Daphne)
Publication venue: 'Morgan & Claypool Publishers LLC'
Publication date: 01/01/2017
Field of study

CERN Document Server

DECANTeR: DEteCtion of Anomalous outbouNd HTTP Traffic by Passive Application Fingerprinting

Author: Bartos Karel
Gu Guofei
Gu Guofei
Hart Michael
Kheir Nizar
Nelms Terry
Perdisci Roberto
Shu Xiaokui
Venkataraman Shobha
Xie Guowu
Zarras Apostolis
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

We present DECANTeR, a system to detect anomalous outbound HTTP communication, which passively extracts fingerprints for each application running on a monitored host. The goal of our system is to detect unknown malware and backdoor communication indicated by unknown fingerprints extracted from a host's network traffic. We evaluate a prototype with realistic data from an international organization and datasets composed of malicious traffic. We show that our system achieves a false positive rate of 0.9% for 441 monitored host machines, an average detection rate of 97.7%, and that it cannot be evaded by malware using simple evasion techniques such as using known browser user agent values. We compare our solution with DUMONT [24], the current state-of-The-Art IDS which detects HTTP covert communication channels by focusing on benign HTTP traffic. The results show that DECANTeR outperforms DUMONT in terms of detection rate, false positive rate, and even evasion-resistance. Finally, DECANTeR detects 96.8% of information stealers in our dataset, which shows its potential to detect data exfiltration.Cyber Securit

Crossref

TU Delft Repository

University of Twente Research Information

DECANTeR: DEteCtion of Anomalous outbouNd HTTP TRaffic by Passive Application Fingerprinting

Author: Bartos Karel
Gu Guofei
Gu Guofei
Hart Michael
Kheir Nizar
Nelms Terry
Perdisci Roberto
Shu Xiaokui
Venkataraman Shobha
Xie Guowu
Zarras Apostolis
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

We present DECANTeR, a system to detect anomalous outbound HTTP communication, which passively extracts fingerprints for each application running on a monitored host. The goal of our system is to detect unknown malware and backdoor communication indicated by unknown fingerprints extracted from a host's network traffic. We evaluate a prototype with realistic data from an international organization and datasets composed of malicious traffic. We show that our system achieves a false positive rate of 0.9% for 441 monitored host machines, an average detection rate of 97.7%, and that it cannot be evaded by malware using simple evasion techniques such as using known browser user agent values. We compare our solution with DUMONT [24], the current state-of-the-art IDS which detects HTTP covert communication channels by focusing on benign HTTP traffic. The results show that DECANTeR outperforms DUMONT in terms of detection rate, false positive rate, and even evasion-resistance. Finally, DECANTeR detects 96.8% of information stealers in our dataset, which shows its potential to detect data exfiltration

Crossref

Profiling user-trigger dependence for Android malware detection

Author: Aafer
Amos
Arp
Barbara G. Ryder
Burguera
Chen
Chin
Cohen
Crussell
Danfeng (Daphne) Yao
Davi
Egele
Elish
Enck
Enck
Forti-guard
Fuchs
Gibler
Grace
Grace
Horwitz
Karim O. Elish
Kim
Liu
Lu
Newsome
Octeau
Peng
Roesner
Sanz
Shabtai
Tan
Wolfe
Xiaokui Shu
Xuxian Jiang
Yang
Yin
Zhang
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref